Presenting a New Dataset for the Timeline Generation Problem
نویسندگان
چکیده
The timeline generation task summarises an entity’s biography by selecting stories representing key events from a large pool of relevant documents. This paper addresses the lack of a standard dataset and evaluative methodology for the problem. We present and make publicly available a new dataset of 18,793 news articles covering 39 entities. For each entity, we provide a gold standard timeline and a set of entityrelated articles. We propose ROUGE as an evaluation metric and validate our dataset by showing that top Google results outperform straw-man baselines.
منابع مشابه
Presenting an evolutionary improved algorithm for the multi-objective problem of distribution network reconfiguration in the presence of distributed generation sources and capacitor units with regard to load uncertainty.
Reconfiguration of distribution network feeders is one of the well-known and effective strategies in the distribution network to obtain a new optimal configuration for the distribution feeders by managing the status of switches in the distribution network. This study formulates the multi-objective problem of reconfiguration of a distribution network in the optimal presence of distributed genera...
متن کاملScheduling security constraint unit commitment for power system including stochastic wind power generation
This paper introduces a new approach for scheduling security constraint unit commitment (SCUC) including wind farms. Because of uncertainty in wind power production, we tried to develop a new method for incorporating wind power generation in power plant scheduling. For this, wind power generation modeled with unit commitment in a non-linear optimization problem and simulated by submitting diffe...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملA Fuzzy Multi Objective Programming Model for Power Generation and Transmission Expansion Planning Problem
The increasing consumption of electricity over time forces different countries to establishnew power plants and transmission lines. There are various crisp single-objective mathematicalmodels in the literature for the long-term power generation and transmission expansion planning tohelp the decision makers to make more reasonable decisions. But, in practice, most of the parametersassociated wit...
متن کاملAssessment Methodology for Anomaly-Based Intrusion Detection in Cloud Computing
Cloud computing has become an attractive target for attackers as the mainstream technologies in the cloud, such as the virtualization and multitenancy, permit multiple users to utilize the same physical resource, thereby posing the so-called problem of internal facing security. Moreover, the traditional network-based intrusion detection systems (IDSs) are ineffective to be deployed in the cloud...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016